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Abstract: We show how the recently proposed XCone jet algorithm [1] smoothly inter¬ 
polates between resolved and boosted kinematics. When using standard jet algorithms to 
reconstruct the decays of hadronic resonances like top quarks and Higgs bosons, one typically 
needs separate analysis strategies to handle the resolved regime of well-separated jets and the 
boosted regime of fat jets with substructure. XCone, by contrast, is an exclusive cone jet 
algorithm that always returns a fixed number of jets, so jet regions remain resolved even when 
(sub)jets are overlapping in the boosted regime. In this paper, we perform three LHC case 
studies—dijet resonances, Higgs decays to bottom quarks, and all-hadronic top pairs—that 
demonstrate the physics applications of XCone over a wide kinematic range. 
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1 Introduction 

The field of jet substructure has matured significantly over the past five years [2-5], with 
a variety of techniques in active use at the Large Hadron Collider (LHC) to tag boosted 
hadronic objects. A prototypical example is the search for heavy new resonances that decay 
to pairs of top quarks [6, 7]; in the all-hadronic channel, each top quark obtains a large 
Lorentz boost from the heavy resonance decay, yielding a collimated fat jet with 3-prong 
substructure. Since not all top quarks are produced in the boosted regime, one should also 
perform analyses in the resolved regime where the top decay products are well-separated and 
identified as individual jets. Since new (and old) physics could show up at any energy scale, 
it is important to develop robust techniques to handle jets in both resolved and boosted 
kinematics. 
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In a companion paper, we introduced a new jet algorithm called XCone that blurs the 
boundary between resolved and boosted kinematics [1]. The name XCone refers to the fact 
that it is an exclusive cone jet algorithm. Like the exclusive kx algorithm [8], XCone always 
returns a fixed number of jets N. When jets are well-separated, XCone yields nearly conical 
jet regions with radius R. When jets are overlapping, XCone dynamically splits the jet regions 
into nearest neighbor partitions. Thus, XCone smoothly interpolates between isolated conical 
jets and merged jets with substructure, making it ideally suited for studying the boosted and 
quasi-boosted regimes. 

In this paper, we present three applications of XCone which are relevant for LHC physics 
in and beyond the standard model. In section 2, we study high-mass dijet resonances with 
isolated final state jets, showing that XCone has nearly identical performance to the popular 
anti -kx algorithm [9]. In section 3, we study associated Higgs boson production, showing that 
XCone can resolve H —>• bb decays, even when the R bb angle is less than the radius parameter 
R, in contrast to anti -kx- In section 4, we study the classic example of boosted top quarks, 
showing how XCone can simultaneously identify jets and subjets in a high multiplicity final 
state, achieving higher signal efficiency than a traditional fat jet strategy. These three case 
studies highlight the versatility of the XCone jet algorithm across a wide kinematic range and 
motivate the use of XCone as a viable alternative to anti-fey. 

It is worth noting that there have been other attempts to merge the resolved and boosted 
regimes into a single analysis, such as ref. [10] which combines different event topologies into 
a single search. Cone algorithms like SISCone [11] have an overlap parameter that can be 
adjusted to achieve some of the desired jet splitting needed to resolve substructure. More 
recently, the “mass jump” algorithm was introduced to avoid merging separated hard prongs 
[12, 13]. A key novelty of the XCone approach is that no explicit distinction is made between 
jets and subjets in the initial jet finding algorithm. XCone can only partially replace a 
dedicated substructure analysis, especially since it is well-known that a fixed radius R no 
longer performs well in the hyper-boosted regime (see, e.g. [14, 15]). We suspect that it will 
be advantageous to combine XCone jet finding with other jet substructure techniques, though 
we do not pursue that possibility in the present work. 

Note that exclusive clustering has long been part of the jet physics toolbox, though mainly 
in the context of sequential recombination algorithms. Indeed, for reasons of computational 
efficiency, XCone uses for-style clustering internally as part of its jet finding procedure [1]. 
As shown in appendix A, exclusive kx clustering [8] (with an R parameter) also successfully 
interpolates between the resolved and boosted regimes. The key difference is that fc^-style 
jets have irregular boundaries and non-uniform active jet areas [16, 17], while XCone jets are 
conical and uniform. This turns out to give XCone a performance advantage over exclusive 
kx, yielding better mass resolution for boosted Higgs bosons and top quarks. 

Before beginning our case studies, we briefly review the XCone jet algorithm [1], XCone 
is based on minimizing the event shape IV-jettiness [18] using a measure inspired by the jet 
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( 1 . 1 ) 


shape IV-subjettiness [19, 20]. A generic definition of IV-jettiness is 

Tn = ^2 min {PietiPi, m), • • • , Pietipi, tin), Pbea,m(Pi)} , 

i 

where ha = {1, ua} are N light-like axes and pi are the particles in the event. Based on a jet 
measure Pjet(pi,nA) and a beam measure PbeamiPi ), the minimum inside of Tn partitions the 
event into N jet regions and one unclustered beam region. Ideally, one would find the global 
minimum of Tn over all possible axes ha, 


Tn = min Tn, 

ni,n, 2 ,...,n N 


( 1 . 2 ) 


though in practice, one uses iterative procedures to find a local Tn minimum starting from 
infrared and collinear safe seed axes. A variety of IV-jettiness measures have been proposed in 
the literature (most especially in refs. [20, 21], see also [22, 23]), but here we stick exclusively 
to the XCone recommended measure, namely the conical geometric measure with 7 = 1 [1]: 


PjetiPi, ^ A ) — PTi 


( 2 n A • Pi 1 
\riTAPTi R 2 


0/2 


PbeamiPi ) — PTi- 


PTi 



(1.3) 


As recommended in ref. [1], we consider two default values for the parameter /3. The XCone 
default is /3 = 2, which (approximately) aligns the jet axis with the jet momentum, as with 
standard cone algorithms [24]. A recoil-free default option is provided by f3 = 1, where the 
jet axis aligns with the hardest cluster in a jet [20, 25], providing enhanced robustness against 
jet contamination [26]. As described in ref. [1], we use a generalized kx clustering algorithm 
to define seed axes for one-pass Tn minimization. 


2 Dijet Resonances and Comparison to Anti-A>r 

For our first case study, we compare the performance of XCone to antiA’T in the resolved 
regime of well-separated jets. Inclusive jet algorithms like anti-fc^ identify a variable number of 
jets above some pt threshold, which is useful for classifying events into different jet multiplicity 
bins. Exclusive jet algorithms like XCone always return a fixed number of jets N, which is 
useful if the number of desired jets is known in advance. For widely separated cone jets, 
however, the distinction between inclusive and exclusive cone jet algorithms is rather mild, 
since for typical R values, an exclusive cone jet algorithm will just return the N hardest 
jets from an inclusive cone jet algorithm. Since anti -kj- acts like an idealized cone algorithm 
for well-separated jets [9], XCone jets should be quite similar to the hardest N anti -Pt jets. 
When we study overlapping jets in sections 3 and 4, the inclusive/exclusive distinction will 
become much more important. 

A good setting to study the resolved regime is a heavy resonance decay to dijets, where 
the two resulting jets are back-to-back and isolated. Here, we consider the scenario 

pp^Z'^qq, ( 2 . 1 ) 
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Dijet Event with ISR Dijet Event with FSR 




(a) (b) 

Figure 1: Example XCone jet regions found with N = 1, 2, and 3. These are dijet resonance 
events, using R = 0.5 and f5 = 2. (a) For widely separated jets, XCone and anti -kx identify 
nearly identical jet regions, including an additional jet from ISR when N = 3. (b) XCone is 
able to identify jet substructure from FSR, even if the jet regions are closer than ~ R. 

where Z' is a heavy boson with mass mz> and q is a u, d , or s quark. We start with N = 1 
and show that XCone typically matches the hardest anti -h? jet, up to an expected two-fold 
ambiguity when the jets are nearly degenerate in px- Going to JV = 2, both XCone and 
anti -kx can successfully reconstruct the dijet resonance peak. Even at N = 3, the found jets 
are quite similar for typical choices of jet parameters, though XCone will identify final state 
jets with substructure. Overall, XCone has essentially identical performance to anti -kx in 
this basic jet reconstruction scenario for N = 2, but can exhibit different behavior for N = 3 
depending on the event topology. Example XCone jet regions are shown in figure 1. 

In the following study, we use Pythia 8.176 [27, 28] to simulate Z' events at the yfs = 14 
TeV LHC. We take mz> = 1 TeV and assume equal couplings to the three light quarks. All of 
the final-state particles (except neutrinos) with \r)\ < 3.0 are considered for analysis. Anti -hr 
jets are found using FastJet 3.1.2 [29] with standard E-scheme recombination. XCone jets 
are found using Nsubjettiness 2.2.0 as part of FastJet Contrib [30], using the XCone 
default measure with /3 = 2 and /3 = 1. For all algorithms, the jet radius parameter is R = 0.5. 

2.1 N = 1 for Hardest Jet 

For N = 1, the XCone jet will tend to align with the hardest anti -kx jet in the event. The 
reason is that the XCone measure in eq. (1.3) penalizes unclustered px by design. In figure 2a, 
we see that anti-^T and XCone yield nearly identical single jet pr spectra, with the expected 
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Dijet N = 1 


Dijet N = 1 


Dijet N = 1 



Figure 2: Single jet kinematics of N = 1 XCone versus the hardest anti -kx jet, measured 
on the dijet resonance sample. Shown are both the XCone default (/3 = 2) as well as the 
recoil-free variant (/3 = 1). (a) Single jet px spectrum, (b) Jet px difference between XCone 
and ant i-kx jet, showing that anti-fc^ jets are slightly harder on average, (c) Jet px difference 
versus azimuth difference, showing the expected two-fold </> •(->- cj) + n ambiguity for dijets of 
comparable px- Here and in figure 5c below, the sizes of the boxes scale logarithmically with 
the number of entries, with solid blue boxes for (5 = 2 and empty red boxes for [3 = 1. 


Dijet N = 1 



Dijet N = 1 
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(b) 


Figure 3: Same N = 1 comparison as figure 2. (a) Single jet areas, showing the expected 
peak at ttR 2 . (b) Jet area difference, showing that anti -kx jets occasionally have a higher 
area, explaining the px asymmetry seen in figure 2b. 


structure at mz ’/2 from a dijet resonance decay. As shown in figure 2c, there is a two-fold 
4> ambiguity in the found jets, as expected from dijet events where both jets have similar px 
values. Note that the box sizes are logarithmic in bin counts, and in the majority of cases, 
XCone and anti-&T find very similar jet regions. 

In figure 2b, we compare the found jet px on an event-by-event basis, and find a sharp 
peak at A px = p^ ltlkT —p^P one = 0. On a logarithmic scale, one can see a small tail extending 
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Figure 4: Dijet kinematics of N = 2 XCone versus the two hardest anti-Zc^ jets, measured on 
the dijet resonance sample, (a) Dijet mass, showing the expected peak at mz> = 1 TeV. (b) 
Dijet mass difference between XCone and anti-Zc^ jet, showing comparable reconstruction. 


to 0(50 GeV) for (3 = 2, with larger deviations possible in the (3 = 1 case. It is interesting 
that the A px distribution is not symmetric, such that anti-Zcy jets tend to have a larger px 
than XCone jets. In figure 3a, we plot the active jet area [16, 17], 1 which is quite similar 
between the two algorithms and peaked at the expected value of -kR 2 . On an event-by-event 
basis, though, there is a population of anti -kx jets that are systematically larger than XCone 
jets, as shown in figure 3b. This occurs because anti-Zc^ clustering can yield jets that extend 
beyond the conical boundary [9]. The A px asymmetry then arises because these slightly 
bigger anti-Zc^ jets contain more particles. 

Comparing the performance of different (3 values, (3 = 2 jets are more similar to anti-Z.tr 
jets since both methods align the jet axis with the jet momentum. The (3=1 jets are slightly 
softer, since they do not recoil away from the hard jet center to absorb soft radiation. In the 
absence of pileup, however, the (3 = 2 and (3=1 performance is quite similar on single jet 
reconstruction. 2 

2.2 N = 2 for Dijet Reconstruction 

For dijet resonance reconstruction, N = 2 is the most natural choice for running an exclusive 
cone jet algorithm. As shown in figures 4a and 4b, both anti-Zc^ and XCone give a good 
reconstruction of the resonance peak, and they largely agree on the mjj value on an event-by- 

1 We use the built-in FastJet area determination routines using active ghosts. For the general conical 
measure introduced in ref. [1] , the active jet area can be determined analytically, though this is not as straight¬ 
forward for the conical geometric measure used here. 

2 In the presence of pileup, the directions of the /? = 1 axes are more robust to pileup contamination [26]. 
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Dijet N = 3 
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Figure 5: Comparing the third hardest jet between XCone N = 3 and anti-fey, measured on 
the dijet resonance sample, (a) Third jet px spectrum, (b) Third jet px difference between 
XCone and anti-fey, showing that XCone has a somewhat harder spectrum due to its ability 
to identify FSR subjets, (c) Third jet px difference versus azimuth difference, showing a 
population of events where the third jet kinematics are completely different between the 
algorithms. Again, the sizes of the boxes scale logarithmically with the number of entries. 


event basis, without much of an asymmetry in the — m^ one distribution. XCone can 

therefore act as a replacement for anti-fey for dijet resonance reconstruction, with comparable 
performance. 3 

2.3 N = 3 and ISR vs. FSR 

Thus far, XCone and anti-fey have exhibited very similar behavior, but differences start to 
appear when considering N = 3. There are two main ways to achieve three jet configurations: 
either there is sufficient initial state radiation (ISR) to form an additional widely-separated 
jet, or there is sufficient final state radiation (FSR) to give one of the primary jets some 
two-prong substructure. In the ISR case (as in figure la), anti-fey and XCone still give very 
similar results since the jets are non-overlapping. In the FSR case (as in figure lb), XCone 
will often identify two separate prongs inside a fat clover jet, whereas anti-fey can only identify 
FSR if it is further away than R from the hard jet core. 

From the px spectra in figure 5a, we see that the third jet is often softer in the anti-fey 
case, as expected if anti-fey tends to identify ISR jets. XCone, on the other hand, is able to 
find FSR radiation that lies close to one of the two original jets, and thus is more likely to 
find a hard third jet adjacent to the hard dijet structure. This is highlighted in figures 5b 
and 5c, which shows that the third anti-fey jet can have completely uncorrelated kinematics 
from the third XCone jet. We can gain further insight in figure 6a, which shows the distance 

3 It is known that the R = 0.5 cone size is typically too small to capture all of the dijet decay products [31]. 
While we did find that better performance could be obtained with somewhat larger R, we wanted all of the 
plots in this paper to have a common cone size for ease of comparison. 
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Figure 6: Same N = 3 comparison as figure 5. (a) Angle of the third hardest jet to the 
nearest harder jet, showing that XCone jets can be located as close as A R = 0 whereas anti- 
fey jets are forced to have A R > R. (b) Third jet area distribution, showing the expected 
peak at nR 2 , but with larger tails than in the N = 1 case in figure 3a. (c) Third jet area 
difference, showing a population of XCone jets with much smaller areas due to jet splitting. 
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Table 1: Comparing XCone N = 3 to anti-fey. Shown is the fraction of events where the 
n th hardest XCone jet is within R/2 = 0.25 of the n th hardest anti-fey jet. (a) The (3 = 2 
default which behaves most similarly to anti-fey• (b) The (3=1 recoil-free variant where 
larger differences are possible. 


between the third jet and the closest harder jet. In the anti-fey case, the third jet is forced to 
be further away than R from the jet core, whereas in XCone case, the third jet can go nearly 
to A R = 0, corresponding to XCone finding substructure within a fat clover jet, as desired, 
instead of finding a separate ISR jet. 

The same effects are visible in the area distributions in figures 6b and 6c. While the 
overall area distributions are not so dissimilar (particularly in the (3 = 2 case), on an event- 
by-event basis, there is a population of events where the third XCone jet has substantially 
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smaller jet area, indicative of jet overlap. This is the flip side of the area distributions for 
N = 1 in figure 3b, where anti -hr jets could grow larger in size by incorporating a neighboring 
subjet. In the XCone case, that subjet is separately identified as its own jet for N = 3. 

Despite these differences, the overall jet reconstruction is still rather similar between 
XCone and anti-fcr- In tables la and lb, we show the fraction of events for which the n th 
hardest XCone jet is within R/2 = 0.25 of the n th hardest anti-A;? 1 jet. For (3 = 2, the three 
hardest jets are well aligned 80% to 90% of the time. For (3 = 1, there are larger deviations, 
though often this is just because the first and second jets are reversed in pt ordering. We 
conclude that the use of XCone is particularly advantageous for tagging small-angle FSR, but 
otherwise will have similar performance to anti -kx- 

Of course, there may be physics contexts where splitting jets by nearest-neighbor is 
undesirable and circular jet regions are preferred. After all, the hardest anti-fc^ jet in an event 
tends to be circular, whereas proximate softer jets form crescent shaped regions, and this is 
often a desirable feature for jet calibrations and calculations. In this context, note that XCone 
jet regions are fully determined by the locations of the corresponding jet axes, independent 
of the details of the jet constituents. Therefore, it is straightforward to test for jet splitting 
by simply checking whether any two jet axes are closer than 2 R. One could even imagine 
running XCone in a mode where N is subsequently decreased until the distances between 
all axis pairs are larger than 2 R, forcing circular jet regions. As we will see below, though, 
it is precisely the ability of XCone to split abutting jets which allows it to handle extreme 
kinematic circumstances where anti -hr reconstruction inevitability leads to jet merging. 

3 Boosted Higgs Bosons and Intelligent Jet Splitting 

To highlight the distinct advantages of XCone, we now consider physics situations where 
resolving substructure is a key element of the analysis. Because XCone always identifies N 
jets, it is well-suited to physics applications with a fixed number of expected (sub)jets. This is 
particularly interesting for cases involving jet substructure, where traditional jet algorithms 
yield merged fat jets, but XCone can identify jets and subjets simultaneously. 

As a well-motivated example at the LHC, consider associated Higgs boson production 
where the Higgs decays to bottom pairs [32, 33]: 

pp —> HZ —> bbvv. (3.1) 

Apart from possible ISR, the final state consists only of two 6-jets. To fight QCD backgrounds, 
the pt of the Higgs boson must be reasonably large [34], However, in this regime, the Higgs 
decay products are more collimated, often resulting in jet merging. Roughly speaking, the 
two 6-jets will be merged into a single fat jet when the Higgs boson is at the scale 

merge _ ( 3 . 2 ) 

R 

In order to counteract this effect, either R can be decreased until it is small enough to resolve 
two separate 6-jets, or jet substructure techniques can be used [35]. 
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Figure 7: Example XCone jet regions for the Higgs analysis for px ~ {200,500,800} GeV, 
using R = 0.5. Top row: A boosted Higgs analysis with N = 1, where XCone and anti -kx 
identify nearly the same jet regions. Bottom row: A resolved Higgs analysis with N = 2, 
where XCone separately identifies the H —>• 66 substructure while anti-fc^ often identifies 
ISR. For each px value, the same event appears for N = 1 and N = 2. 


This (quasi-)boosted Higgs analysis is well-suited for XCone. At minimum, N = 1 
can identify a single fat jet, after which existing jet substructure techniques can be applied. 
Though we will not perform a detailed jet substructure analysis here, we will show that XCone 
with N = 1 has nearly the same signal efficiency as anti-A^ as a function of px, and therefore 
can be used as a suitable starting point for a full boosted Higgs analysis. 

More intuitively, N = 2 can be used to identity the two hard 6-jets in the event at all px 
scales. This N = 2 strategy is very similar to what is already being done in existing ATLAS 
and CMS studies [32, 33], where anti-A’T is used to resolve two separate 6-jets. For anti-fc^; 
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the 6-jets merge at high enough px , so this resolved technique is no longer efficient (see [36] 
for a recent discussion). For XCone, we show that the corresponding N = 2 resolved strategy 
can be pushed deep into the high px regime while maintaining good signal efficiency. For this 
reason, we advocate XCone as a promising approach to extrapolate resolved analyses into the 
boosted regime. 

Example event displays using the N = 1 and N = 2 methods are shown in figure 7. In the 
text, we restrict our comparisons to anti-fey, though in appendix A.l, we also show results for 
exclusive fey, which has performance comparable to XCone, albeit with irregular jet bound¬ 
aries. A full accounting of background processes is beyond the scope of this work, though we 
do perform a sanity check in appendix B to verify that XCone does not unnecessarily sculpt 
the Z +jets background. Since the relative performance of the boosted and resolved strategies 
depends on the details of the background, we postpone a direct comparison of IV = 1 with 
N = 2 to future work. 

Like the previous dijet study, we use Pythia 8.176 [27, 28] at the y/s = 14 TeV LHC to 
simulate pp —>• HZ. For simplicity, we force the decays Z —>• vv and H —>• 66. Like in the 
previous study, all of the final-state particles (except neutrinos) with |? 7 | < 3.0 are considered 
for analysis. In order to analyze the properties of the algorithm in different px regimes, we 
place generator-level px cuts on the Higgs boson between 200 and 1000 GeV. We use the 
same R = 0.5 jet radius for all analyses in this section, such that p“ crge ~ 500 GeV is in the 
middle of our studied px range. 

3.1 N = 1 for Boosted Analysis 

Since the pioneering work in ref. [35], the boosted Higgs channel has often been been analyzed 
by finding one fat jet with a large radius parameter, and then using substructure techniques 
to analyze its properties (see e.g. [37]). Here, we compare N = 1 XCone to anti-fey to show 
that they give similar behavior in the boosted regime, though we stick to a relatively small 
R = 0.5. 4 

Unlike in the dijet resonance study, the difference between (3 = 1 and /3 = 2 is more 
noticeable for quasi-boosted Higgs bosons. As described in refs. [20, 25], (3 = 1 minimization 
aligns the jet axis with the hardest cluster within a jet, whereas j3 = 2 minimization places 
the jet axis approximately in the direction of the jet momentum. Roughly speaking, /3 = 1 
finds the “median” jet axis direction whereas (3 = 2 finds the “mean” jet axis direction. For 
the boosted Higgs case, the (3 = 1 jet is more likely to point in the direction of one of the two 
6-jets, while the (3 = 2 jet is more likely to lie in between the two 6-jets and track the Higgs 
momentum direction. Anti-fey (with standard E-scheme recombination [41]) acts like (3 = 2 
since it also aligns the jet axis with the jet momentum. 

In the top row of figure 8, we show the single jet invariant mass as the minimum Higgs 
px (at generator level) is adjusted from 200 GeV to 500 GeV to 800 GeV. With increasing 

4 The original BDRS paper used the Cambridge/Aachen algorithm [38-40] to identify this fat jet. To avoid 
a proliferation of curves, we only compare XCone to anti -kr in our analysis. Regardless of the fat jet starting 
point, one can still recluster with Cambridge/Aachen to apply the BDRS mass drop criteria. 
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Figure 8: Top row: Comparing boosted Higgs reconstruction between XCone with N = 1 
and the hardest anti-fey jet, using R = 0.5. As the Higgs px increases from (a) 200 GeV 
to (b) 500 GeV to (c) 800 GeV, both methods capture the merged Higgs decay products, 
yielding a growing mass peak at run = 125 GeV. Bottom row: Comparing Higgs jet area 
using XCone N = 1 and the hardest anti-Vr jet. All distributions show the large expected 
peak at A = 7t(0.5) 2 . 


Pt, more of the Higgs decay products are contained inside a single jet and the peak at 
rrij = 125 GeV grows. By eye, the anti -kx and (3 = 2 distributions are quite similar, whereas 
the [3=1 case has a somewhat worse performance since the jet axis is misaligned from the 
Higgs boson momentum. Though not shown in the plot, the default conical geometric XCone 
measure [1] yields a slightly better Higgs peak than the original conical measure [20]. In the 
bottom row of figure 8, we show the distribution of active jet areas, where both algorithms 
have a peak at ttR 2 , though anti -kx has a slight high-side tail when px — p^ clgc . 

We quantify the Higgs reconstruction efficiency in figure 9, which shows the fraction of 
jets in the Higgs mass window rrij € [100,150] GeV. At very high px values, the algorithms 
have very similar performance, but [3 = 2 does better in the vicinity of p™ cigc . This is because 
the [3 = 2 jet axis is more likely to lie in between the two 6-jets, so the jet is more likely to 
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Figure 9: Efficiency for N = 1 boosted Higgs reconstruction as a function of Higgs pr , 
with the mass window mj € [100,150] GeV. The efficiency grows when the pr is above the 
merging scale 2mn/R- XCone [3 = 2 outperforms (3 = 1 in the transition region since the 
former centers the jet along the Higgs momentum. 

capture the full Higgs decay products. As expected, anti -hr and [3 = 2 are very similar. 

3.2 N = 2 for Resolved Analysis 

For this Higgs production scenario, the real power of XCone comes from using N = 2. In 
the unboosted regime, the standard analysis strategy is to find two fo-jets, reconstruct their 
invariant mass, and look for a peak at the known Higgs mass [32, 33]. In the boosted regime 
with jet merging, though, algorithms like anti-fey are likely to find one fat Higgs jet and one 
ISR jet elsewhere in the event, so a dijet reconstruction strategy is no longer effective. By 
contrast, since XCone is an exclusive cone algorithm, it will always identify two jets regardless 
of the Higgs pr ■ To find boosted Higgs bosons with XCone, we can simply run with N = 2 
and perform a standard resolved jet analysis. 

In the top row of figure 10, we show the reconstructed dijet invariant mass comparing 
XCone with anti-fey. For low Higgs py, all algorithms find the Higgs peak with roughly 
the same line shape. 5 As the Higgs pr increases, the anti-fey distributions move to higher 
dijet masses because of a merged Higgs jet being paired with an ISR jet, whereas XCone 
maintains good performance regardless of py. In the bottom row of figure 10, we show the 
jet area distributions. Anti-fey jets peak at itR 2 regardless of the Higgs pr, whereas XCone 
jets transition from nR 2 at low pr to roughly half that at high py, indicative of the desired 
split jet regions. 

We show the Higgs reconstruction efficiency in figure 11a as a function of the Higgs py. 
Anti-fey jets start to merge around pr = 300 GeV and the Higgs efficiency drops significantly. 

5 The low mass tail in each of the plots can be explained by neutrinos from B meson decays within the b-jet. 
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Figure 10: Same as figure 8 but for N = 2. As the Higgs px increases from (a) 200 GeV to 
(b) 500 GeV to (c) 800 GeV, anti -kx suffers from jet merging, whereas XCone yields a dijet 
Higgs peak across the px spectrum. At low px, both distributions have the expected area 
peak at A = 7r(0.5) 2 . As px increases, the XCone area falls to roughly half its original value, 
indicative of overlapping jets. See figure 18 for a comparison to exclusive kx- 


XCone has nearly flat efficiency as a function of Higgs px, even as the px crosses beyond the 
^mcrge gca j e . ^t higher px values, the (3 = 2 jets see a performance degradation, since the 
(3 = 2 jets are more influenced by ISR at wide angles. The (3 = 1 jets are able to maintain 
their performance since the jet axes tend to always align with the momentum of the Higgs 
decay products. Overall, the XCone reconstruction efficiency is around 65% for R = 0.5. 

3.3 N = 3 for ISR Vetoing 

To improve the XCone performance, we have to account for ISR, which is the leading cause 
of misreconstruction. In the presence of hard ISR, XCone can identify an ISR jet instead of 
finding one of the two b jets. To address this issue, we can explicitly identify the ISR jet using 
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Higgs Efficiency for N = 2 



(a) 


Higgs Efficiency for N = 3 + ISR Veto 



(b) 


Figure 11: (a) Efficiency for N = 2 resolved dijet Higgs reconstruction as a function of Higgs 
Pt , with the mass window m,j € [100,150] GeV. We see a degradation in the efficiency of the 
anti-fey spectrum at higher px due to jet merging, while XCone produces constant efficiency 
across the spectrum at around 65%. Here, (3 = 1 outperforms (3 = 2, since the former is less 
susceptible to wide-angle jet contamination, (b) Same as figure 11a, but now allowing the 
Higgs to be reconstructed with either N = 2 or N = 3, using the minimum pairwise mass to 
veto ISR. Now, XCone (3 = 1 and (3 = 2 have comparable performance at around 75%. See 
figure 19 for a comparison to exclusive fey. 


N = 3 and find the best reconstruction among the N = 2 and N = 3 options. 6 

We first run XCone with N = 2 and check whether the dijet mass is in the nrijj € 
[100,150] GeV window. If not, we run XCone with N = 3 and apply the Higgs mass test on 
the pair of jets with the smallest invariant mass, as these are kinematically the most likely 
candidates to be the Higgs decay products. Allowing two pathways for Higgs reconstruction 
gives improved signal efficiency, and in figure lib, we see that both (3 = 1 and (3 = 2 now have 
efficiencies around 75%. Applying the same 2- and 3-jet technique to anti-fey does improve 
the signal efficiency somewhat, though XCone still has better performance for px > 300 GeV. 

We conclude that XCone is highly efficient in reconstructing Higgs bosons across a range 
of kinematics, from the resolved to quasi-boosted to boosted regimes. Comparing figure 9 with 
figure 11, we see that N = 1 does yield better signal efficiency at very high boosts, whereas 
N = 2,3 yields uniform (and still quite good) performance. Ultimately, one may want to 
combine all three methods, although this would require a full understanding of background 
processes (see discussion in appendix B). In a full analysis, one would also want to exploit 
6-tagging to better identify the Higgs candidate and mitigate non-6 backgrounds. This is 
especially important when using the combined N = 2,3 method, where background events 

6 An alternative approach is to use a modified beam measure (such as the 7 = 2 option discussed in ref. [1]) 
to preferentially select central jets. 
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(a) (b) 


Figure 12: Two reconstructed events from the BOOST 2010 top sample, using XCone with 
N = 6 and R = 0.5. As is often the case, N = 6 and N = 2x3 give identical results for these 
events. Compared to anti -kx, XCone directly identifies three prong substructure through the 
initial jet finding. While [3 = 2 and /3 = 1 often give similar jet regions, they can differ more 
substantially, as shown on the right. 


have two pathways to land in the signal window. But the main take away from this study 
is that XCone allows traditional resolved analyses to be extended into the boosted regime, 
providing a p^-independent method for Higgs reconstruction. 

4 Boosted Top Quarks and High-Multiplicity Final States 

Given the success of XCone in reconstructing boosted Higgs bosons, we now test whether 
XCone can handle the increasingly complex final states possible at LHC collision energies. 
An important process at the LHC is pair production of top quarks with fully hadronic decays: 

pp —»• tt, t^-Wb^-qqb. (4.1) 

At low rritt, the final state consists of six resolved jets. At high rn t i, the jets are arranged into 
two fat jets with three-prong substructure, and a variety of substructure techniques have been 
developed to tag these boosted tops (see, e.g. [42-45]). Here, we show that XCone with N = 6 
can identify each of the six individual (sub)jets, regardless of the rn t j value, allowing the same 
analysis strategy to be effective in both the resolved and boosted regimes (see appendix A.2 
for similar behavior from exclusive fey). We also show a more efficient N = 2x3 method 
where the event is first partitioned into hemispheres using N = 2 and then separated into 
sub jets by applying N = 3 in each hemisphere. 
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Boosted Top with ISR 


Four Leaf Boosted Top 




Figure 13: Pathological N = 6 reconstructions that are improved by using IV = 2x3. For 
these hand selected events, N = 2x3 correctly identifies the boosted tops, while N = 6 
identifies (a) an additional hard jet from ISR and (b) a fourth leaf in one of the boosted tops. 
Not shown are examples where the N = 2x3 method finds an ISR jet, which can sometimes 
be resolved using A r = 2x4orA r = 7. 


Our study is based on the BOOST 2010 events samples [2], which were generated for the 
7 TeV LHC.' For the boosted top signal, we use the Herwig tt —> hadrons samples where the 
generator-level top px ranges from 200-800 GeV in bins of 100 GeV. We also apply XCone to 
the Herwig dijet background sample in the same pt bins. As in the boosted Higgs study, we 
take R = 0.5. When comparing to traditional fat jet studies, we use anti-kr jets with R = 1.0 
as recommended in the BOOST 2010 report [2]. For brevity, we do not include a straight 
N = 2 fat jet study for XCone, since the results are similar to those found in section 2. 
Example event displays from XCone are shown in figures 12 and 13. 

At the outset, we want to emphasize that XCone is able to handle partially overlapping 
jets, as expected in the quasi-boosted regime. In the highly boosted limit, however, the 
sub jets are fully overlapping, so substructure methods based on fat jets are typically more 
effective at signal/background separation. While it is possible to combine XCone with jet 
shapes like A r -subjettiness [19, 20] for improved performance in the highly boosted limit, 
we find in preliminary studies that there is no real advantage to using N = 6 over a more 
traditional fat jet analysis with N = 2. The key advantage of XCone is that it yields relatively 
uniform performance over a broad px range, and while specialized techniques can achieve 

7 While the BOOST 2011 report [3] included updated benchmark samples, those event files were lost due 
to “spring cleaning” at the host servers. Instead of generating fresh boosted top samples at 14 TeV collision 
energies, we have decided to use the BOOST 2010 event samples to enable easier comparisons and verifications 
of our results. 
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better performance at extreme kinematics, XCone allows resolved techniques to be applied 
even when jets are overlapping. 

4.1 N = 6 Baseline Analysis 

The most straightforward application of XCone for hadronic tops is using N = 6 to resolve 
six jets. Like in section 3.3, one can try to improve the performance by explicitly identifying 
ISR jets using N = 7, but we find that the N = 2x3 method shown below is more effective 
at dealing with the combinatorial complexity of this final state. For these studies, we have 
not incorporated 6-tagging information, though it would be straightforward to use XCone in 
conjunction with recent subjet 6-tagging methods [46, 47]. 

After running XCone with N = 6, we want to partition the jets into two top candidates in 
a way that is p^-independent. We do this by finding all = 20 ways of partitioning the jets 
into two three-jet clusters, and then finding the configuration that minimizes the scalar sum 
of the three-jet masses. Much like in the boosted Higgs case, we expect that minimizing the 
mass is most likely to yield the correct top candidates. For an apples-to-apples comparison, 
we apply the same analysis strategy on the six hardest anti -kx jets. In a more sophisticated 
analysis, one could use a ^-minimization approach to find the best top candidates, also 
incorporating IF-mass and 6-tagging information. 

In figure 14a, we show the reconstructed top jet mass in the px € [400,500] GeV bin, 
comparing XCone and anti -kx with N = 6. The XCone distributions show a better resolved 
top peak, which is more symmetric around the top mass and has a substantially reduced 
high-mass tail. In the area distributions in figure 14b, we see that XCone jets are peaked at 
roughly (2/3)7 tR 2 , where the factor of 2/3 is expected since the jets are arranged in a clover 
configuration around the boosted top direction. The anti -kx jets are peaked around irR 2 , as 
expected since anti-fey jets do not typically overlap. Because of subjet mergers, the six found 
anti -kx jets are not all associated with the top quarks, leading to large invariant mass values 
from ISR jets. 

The equivalent dijet background distributions are shown in figures 14c and 14d. In the 
absence of genuine three-prong substructure, the XCone jets tend to be scattered throughout 
the event, leading to large reconstructed invariant masses and nR 2 areas. The effect is 
even stronger in the anti-fey jets, since they avoid overlaps. There is a noticeable difference 
between the (3 = 1 and (3 = 2 QCD dijet distributions, indicating that the mean versus median 
effects described in section 3.1 are significant for quark and gluon jets without well-defined 
substructure (a feature exploited in ref. [48]). 

To define the top signal region, we take a top mass window of rrijjj € [150,200] GeV. 8 We 
also apply a VF-tagging cut as described in the CMS analysis [49], by analyzing each pairwise 
combination of the three subjects and requiring a minimum pairwise invariant mass cut of 
rrijj t min > 50 GeV. In figures 15a and 15b we show the efficiency and mistag rates for the 

8 Compared to the typical mass ranges used in the BOOST 2010 report [2], this range is smaller and more 
symmetric. Because the peak from XCone is more narrow and symmetric around the top mass, we can use 
this tighter mass window without much loss in signal efficiency. 
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Top N = 6 (p g [400,500] GeV) 



QCD N = 6 (p g [400,500] GeV) QCD N = 6 (p e [400,500] GeV) 




Figure 14: Top row: Comparing resolved three-jet top reconstruction between XCone with 
N = 6 and the six hardest anti-^T jets, in the pt £ [400,500] GeV bin. Here, top candi¬ 
dates are identified by minimizing the sum of the three-jet masses, (a) Candidate top mass 
distributions, showing that XCone does not have as pronounced of a high mass tail due to 
ISR. (b) Area of all six jets, showing a peak at (2/3)7 xR? for XCone expected of clover jet 
configuration compared to a peak at txR 2 for anti -hr expected of separated jets. Bottom row: 
Same for the QCD background. See figure 20 for a comparison to exclusive hr. 
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Top Efficiency for N = 6 QCD Mistag for N = 6 Signal Significance for N = 6 



(a) (b) (c) 


Figure 15: Comparing boosted top and XCone performance with N = 6 as a function of 
top pt for (a) signal efficiency, (b) background mistag, and (c) signal significance gain. By 
avoiding jet mergers, XCone has improved performance compared to anti -kx across the entire 
Pt range. See figure 21 for a comparison to exclusive kT- 


top and dijet samples as a function of (generator-level) pr- XCone has a signal efficiency of 
around 60% across the entire pt range, showing the desired scale invariance. While anti-Atp 
starts with the same 60% efficiency in the resolved regime, the efficiency drops considerably 
with pt due to jet merging, analogous to what was found in section 3.2. Both methods have 
around a 10% background mistag rate, which is relatively stable as a function of pt- 

The improvement in signal significance (S/yfB) is shown in figure 15c, where we see 
that the performance remains relatively flat across the entire pt range, with performance 
comparable to or better than anti -kT- Note that we have not included 6-tagging information 
nor additional jet shape information, so background rejection factors can be much larger 
in practice. From this study, we see that a simple application of XCone allows for a pr- 
independent analysis strategy even for complicated final states. 

4.2 N = 2 x 3 Improved Analysis 

To further improve on the performance of XCone, we can take into account the event topology. 
Even with a moderate boost, the top decay products tend to arrange themselves into two 
hemispheres, a feature that is exploited, for example, in the HEPTogTagger [50] (see also 
[51]). Thus, we can use XCone in multiple stages, first dividing the event into separate top 
candidate regions with N = 2 and R —>• oo, and then finding jets in each of those regions 
using N = 3 and R = 0.5. 9 

There are two advantages of this N = 2x3 approach over the N = 6 approach. First, 
it reduces combinatorial confusion and increases computational efficiency. Second, it ensures 
that each top candidate has the potential to involve three jets. Even without ISR (as in 

9 For a theoretical analysis of the top mass using a related hemisphere approach, see refs. [52, 53]. 
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Top N = 2x3 (p e [400,500] GeV) QCD N = 2x3 (p x e [400,500] GeV) 




Figure 16: Reconstructed mass distributions with the N = 2 x 3 strategy, for (a) top signal 
events and (b) QCD background events. Here we compare the XCone N = 2x3 method to two 
traditional methods: a boosted strategy (“Bst”) where two anti -fey R = 1.0 jets have three 
exclusive fey subjets, and a resolved strategy (“Res”) where two exclusive hr hemispheres 
have three anti -fey R = 0.5 jets. 

figure 13a), the N = 6 method can yield one four-leaf top clover and one two-leaf top clover 
(as in figure 13b), something that is avoided with N = 2 x 3. While it is possible to get 
even higher signal efficiencies by applying N = 2 x 4 and vetoing ISR jets (analogous to 
section 3.3), such an approach tends to also increase the background rnistag rate, so we will 
not show N = 2 x 4 results here. 

We can compare XCone to traditional top reconstruction methods in two different ways. 
For a traditional boosted strategy (“Bst”), we can run anti-fey with R = 1.0 to find two fat 
jets and then run exclusive fey with N = 3 and R = 0.5 on the fat jet constituents to identify 
subjets. For a traditional resolved strategy (“Res”), we can run exclusive fey with N = 2 
and R —>• oo to find hemisphere regions, and then run anti-fey with R = 0.5 to find the three 
hardest jets in each hemisphere. As we will see, XCone N = 2 x 3 effectively interpolates 
between these behaviors as a function of py, reproducing (and sometimes surpassing) the 
best performance of the traditional strategies in their respective domains. To highlight the 
advantages of XCone, we will be working in the regime of high signal acceptance; in the 
regime of high background rejection, it is well known that anti-fey-based boosted strategies 
are highly effective when combined with jet substructure methods. 

The resulting top mass distributions are shown in figure 16a, again in the py £ [400, 500] 
GeV bin. The top mass distribution for XCone N = 2x3 jets is similar to the N = 6 
case, continuing to maintain a peaked, symmetric shape around the top mass. Crucially, 
N = 2x3 reduces the high-mass tail since there are more correctly reconstructed top quarks. 
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Top Efficiency for N = 2x3 QCD Mistag for N = 2x3 
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Figure 17: Comparing boosted top and XCone performance with N = 2 x 3 as a function 
of top pt for (a) signal efficiency, (b) background mistag, and (c) signal significance gain. 
XCone interpolates between the traditional resolved strategy at low px and the traditional 
boosted strategy at high px, with improved performance in the boosted regime due to the 
addition of soft subjets (at moderate px) and grooming away of jet contamination (at higher 
Pt)- 

The traditional boosted strategy results in a W shelf caused by the R = 1.0 jet radius not 
containing the full top decay products, while the traditional resolved strategy has a high- 
side mass tail from the inclusion of ISR jets. XCone avoids both of these pitfalls, giving an 
excellent overall reconstruction. 

Similarly, as shown in figure 16b, the background mass distribution for XCone falls in 
between the traditional boosted and resolved strategies. Like in the N = 6 case, both 
N = 2 x 3 XCone jets and traditional resolved jets tend to be scattered throughout the 
hemisphere, leading to large invariant masses. However, the additional hemisphere constraint 
from using N = 2x3 helps to control this effect, giving smaller masses on average for N = 2x3 
than N = 6. 

The signal efficiency and background mistag rates are given in figures 17a and 17b, again 
for the rrijjj € [150,200] GeV top mass window and rrijj )mm > 50 GeV W mass cut. For 
the signal efficiency, it is clear the XCone interpolates between the good traditional resolved 
performance at low px and the good traditional boosted performance at high px, yielding 
approximately a 60% reconstruction efficiency throughout the px range. For the background 
mistag rate, XCone holds steady at 10%, where again, further improvements are possible 
using 5-tagging or substructure information. 

The improvement in signal significance ( S/y/B ) is shown in figure 17c, where across 
the whole px range, the performance of XCone matches or surpasses that of the traditional 
boosted and resolved strategies. In the boosted regime, the improvement of XCone over 
traditional methods is due to two different effects already alluded to above. At moderate 
px, XCone can identify soft subjets that fall outside of the normal anti-fey cluster radius 
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(see also ref. [54]), increasing the jet mass into the top window. At high pr, R = 0.5 acts 
similarly to the subjet radius in filtering [35] or trimming [55], removing jet contamination 
and decreasing the jet mass into the top window. We conclude that XCone gives a powerful 
way to extend conventional resolved analysis strategies into the boosted regime, especially if 
the goal is obtaining high signal efficiency. 

5 Conclusions 

In this paper, we presented three case studies to show how XCone can be used in a range 
of LHC analyses, producing comparable or better results than conventional methods. Re¬ 
markably, a single benchmark cone size of R = 0.5 was able to successfully reconstruct dijet 
resonances, boosted Higgs bosons, and boosted top quarks. We are particularly encouraged 
by XCone’s ability to smoothly interpolate between the resolved and boosted regimes, and 
we anticipate that further improvement will be possible by combining XCone jet finding with 
additional jet substructure discriminants. 

The focus of this paper has been on the physics applications of XCone, but it is also im¬ 
portant to note the relative computational efficiency of XCone compared to other algorithms. 
In the case of boosted tops with IV = 6, we find that the anti -kx algorithm runs at an average 
speed of 0.3 ms per event on a typical laptop, while XCone takes around 7.3 ms per event, 
which is roughly 25 times slower, though still relatively fast. If computational speed is a 
priority, one can use XCone with the seed axes directly as the jet axes (i.e. no minimization 
step), which is dubbed “PseudoXCone” in the Nsubjettiness contrib. This takes around 
0.7 ms per event, only 2.3 times slower than anti-Zc^. As shown in the companion paper, 95% 
of the seed axes are located at or near the global IV-jettiness minimum [1], so the effectiveness 
of PseudoXCone is comparable to that of XCone. As an example, there are only percent-level 
differences in the boosted top signal significance by using PseudoXCone instead of XCone. 

Beyond the examples shown in this paper, XCone should also work well in even more 
complex final states, including multi-jet searches for physics beyond the standard model. 
A key challenge for any exclusive approach is how to best veto ISR jets, and it may be 
that multiple N values will be needed reach optimal performance. One intriguing possibility 
for new physics searches is consider different IV-jettiness beam measures. In particular, the 
conical geometric measure in ref. [1] has a 7 = 2 option (compared to the recommended 7 = 1 
default), which preferentially select jets in the central region. This would help not only to 
veto ISR jets but also to control QCD backgrounds. 

In an experimental context, a key practical question is how to calibrate the energy scale 
of XCone jets. For widely separated jets, we have seen that XCone jets are essentially the 
same as anti-fcr jets, so existing calibration strategies should be straightforward to adapt to 
XCone. The case of overlapping jets is more complicated in XCone, since jet regions can 
have varying shapes and sizes. As already mentioned, though, XCone jet regions are fully 
determined by the location of the jet axes, independent of the details of the jet constituents, 
so one could develop a calibration strategy that accounts for, say, the active jet area [16, 17] 
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when determining the jet energy scale. Indeed, such a strategy is already used for anti -kx 
jets in the context of pileup mitigation, though further studies are needed to understand 
systematics in the XCone case. 

Ultimately, the choice of jet algorithm should depend on the physics process of interest. 
For situations with widely separated jets, exclusive cone algorithms like XCone and inclusive 
cone algorithms like anti -kx yield rather similar performance, so there is no obvious reason 
to prefer one algorithm over the other. XCone does have the appealing features of yielding 
well-defined rigid jet geometries and fitting nicely into IV-jettiness factorization theorems 
(see [1]), though this has to be balanced against the ubiquity, simplicity, and computational 
efficiency of anti-/ftp. For this reason, the advantages of XCone are most prominent in the 
boosted regime of overlapping (sub)jets, where standard anti -kx reconstruction is simply not 
applicable while XCone can dynamically split jet regions. 

We have emphasized how XCone allows analysis strategies developed in the more familiar 
resolved regime to be extrapolated into the boosted regime. One could also adopt the reverse 
strategy of taking analysis strategies based on boosted jets and extrapolating them into the 
resolved regime. The HEPTogTagger [50] is an example of this reverse strategy, since it uses 
fat jets to achieve good signal efficiency at low px- The variable R jet algorithm [56] can also 
be used to implement a reverse strategy, since it can match the jet radius of a fat jet to the 
Pt of the boosted object of interest. A priori, it is not clear whether resolved-to-boosted or 
boosted-to-resolved approaches will be more performant, so it is important to develop both 
types of strategies, as well as hybrid strategies like our N = 2 x 3 method. 10 

Given the value of exclusive jet algorithms for lepton colliders but the advantages of 
conical jets for hadron colliders, we expect that exclusive cone jet algorithms will find useful 
applications beyond the ones studied in this paper. Because XCone effectively separates 
jet axis finding from jet region finding, one could even imagine a generalized exclusive cone 
strategy that dynamically chooses different jet radii R depending on the final state, making 
it possible to treat even more extreme kinematics. We look forward to seeing how XCone and 
its generalizations perform in future analyses, as the LHC continues to pursue physics in and 
beyond the standard model over a wide kinematic range. 
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A Comparison to Exclusive fey 

The focus of this paper has been on comparing XCone to anti-fey, which is essentially com¬ 
paring an exclusive cone algorithm to an inclusive cone algorithm. It is also instructive to 
compare XCone to other exclusive non-cone algorithms, which we do in this appendix. Given 
the popular use of sequential recombination algorithms, the exclusive variants of fey [8, 58] 
and Cambridge/Aachen (C/A) [38-40] clustering provide the most useful comparisons. 

Because C/A clusters according to angles alone, it turns out to yield rather poor perfor¬ 
mance when dealing with events with multiple angular scales. This can also be understood 
from the study in ref. [1], which shows that C/A does not provide good seed axes for N- 
jettiness minimization. For this reason, we only compare to the exclusive fey algorithm, as it 
provides the most similar performance to XCone. Indeed, the 0 = 1 minimization procedure 
in XCone starts from seeds derived from kx clustering. 11 

As explained in section 2, the distinction between exclusive cone algorithms and inclusive 
cone algorithms is largely irrelevant in the case of well-separated jets. By contrast, the 
distinction between cone jets and kx- style jets is rather important for well-separated jets, 
especially at large jet radius. Famously, fey-style jets have distinctly non-conical boundaries 
which are determined by the configuration of soft radiation within the jet, yielding a broad 
spectrum of jet areas [16, 17]. For this reason, it is not really fair to compare XCone to 
exclusive fey with well-separated jets, since XCone simply inherits the advantages of anti-fey 
and other cone-like algorithms. 

The comparison between XCone and exclusive fey is most instructive in the boosted 
regime, where area effects are less important and the distinction between inclusive and ex¬ 
clusive jets is more pronounced. We restrict our attention to N = 2 and N = 3 boosted 
Higgs reconstruction (see sections 3.2 and 3.3) and N = 6 boosted top reconstruction (see 
section 4.1). We do not show N = 2 x 3 for boosted tops, since section 4.2 already presented 
a similar study with exclusive fey. 

A.l Boosted Higgs Bosons with N = 2 

We saw that XCone with N = 2 can reconstruct H —>• bb decays in both the resolved and 
boosted regimes, and N = 3 can be used to improve performance by accounting for possible 
ISR in the event. Here, we can repeat the exact same analysis strategy using the exclusive 
fey algorithm, with the same choice of R = 0.5 and N = 2,3. The XCone distributions shown 
here are identical to the ones shown in section 3. 

n For /3 = 1 minimization, XCone uses the winner-take-all recombination scheme [25, 59, 60]. For the 
exclusive kx studies here, we stick with standard .E-scheme recombination. For /? = 2 minimization, XCone 
uses a generalized kx measure halfway between kx and C/A. 
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Higgs N = 2 (p > 500 GeV) 
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Higgs N = 2 (p x > 800 GeV) 
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Figure 18: Same as figure 10, but with exclusive kx- As the Higgs px increases from (a) 
200 GeV to (b) 500 GeV to (c) 800 GeV, both XCone and exclusive kx yield a dijet Higgs 
peak. Exclusive kx shows a larger low-mass tail, though, indicating that it does not always 
capture all of the Higgs decay products. This is due to the irregular shape of the kx jets, as 
quantified by the jet areas shown in the bottom row. 


The top row of figure 18 shows that the reconstructed Higgs mass distributions are rather 
similar between the two algorithms, indicating that exclusive kx also successfully finds the 
Higgs decay products across a wide kinematic range. However, exclusive kx has a noticeably 
larger low-mass tail, indicating that the algorithm does not always capture all of the Higgs 
decay products. This is due to the irregularity of the resulting kx jet shapes. As shown in 
the bottom row of figure 18, the exclusive kx jet area distributions are rather broad, though 
they roughly peak at ttR 2 as expected. Small-area jets give rise to the low-mass tail while 
large-area jets are responsible for the high-mass tail. By keeping a more uniform jet area 
distribution, XCone achieves better Higgs mass resolution. 

The Higgs efficiency of the two algorithms is shown in figure 19, using both the N = 2 and 
N = 3 methods. Both XCone and exclusive kx follow a similar pattern, displaying roughly 
constant efficiencies across the entire px range with improved performance going into the 
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Figure 19: Same as figure 11, but with exclusive kx- The efficiency of both XCone and 
exclusive kx remain roughly constant across the entire kinematic range. XCone shows a dis¬ 
tinct advantage over exclusive kx, however, as its more regular shape allows it to consistently 
capture the Higgs decay products. 

boosted regime. XCone yields a higher overall efficiency, though, due to the jet shape issue 
described above. We conclude that the conical nature of XCone gives it an advantage over 
exclusive kx for boosted Higgs reconstruction. 

A.2 Boosted Top Quarks with N = 6 

The comparison between exclusive kx and XCone is also instructive for high multiplicity final 
states, such as boosted top reconstruction. We apply the same analysis strategy as described 
in section 4.1, comparing exclusive kx to XCone with R = 0.5 and N = 6. 

The mass distributions for exclusive kx jets are shown in the left column of figure 20. For 
the boosted top signal, the top mass peak is very similar to that of XCone, with a somewhat 
less pronounced peak and an offset to higher mass values. This same high-mass shift is seen 
in the QCD background distributions. In the right column of figure 20, one sees again the 
broad structure of the exclusive kx jet area distributions. Even though exclusive kx seems to 
successfully find the same top decay products as XCone, the jet shapes are rather different, 
and the high-mass shift is due to the large jet areas sometimes found by exclusive kx- 

The signal efficiency and background rnistag rates are shown in figure 21. The signal 
efficiency of XCone and exclusive kx are very similar across the whole px range, with exclusive 
kx even yielding better performance at low px- With exclusive kx, however, the W mass cut 
of 50 GeV is less effective at controlling backgrounds, yielding a higher background rnistag 
rate than XCone. This allows XCone to achieve higher signal significance than exclusive kx- 
That said, it may be possible to achieve the same performance gains of XCone by using 
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Figure 20: Same as figure 14, but with exclusive kx- The mass distributions in (a) and (c) 
are very similar to XCone for both signal and background, but the area distributions in (b) 
and (d) show that the exclusive kx jets have a more inconsistent shape than the XCone jets, 
as expected. 
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Figure 21: Same as figure 15, but with exclusive kx- The signal efficiency is similar between 
XCone and exclusive kx, but the background mistag is higher for exclusive kx, giving XCone 
the edge in terms of signal significance. 


a hybrid cone and kx clustering scheme. 12 For example, one could mitigate the high-mass 
shift coming from large area jets by running an N = 1 cone algorithm on each of the N = 6 
exclusive kx jets. Alternatively, one could first find the N = 6 hardest jets from an inclusive 
cone algorithm and then run N = 6 exclusive kx on the combined jet constituents. While 
perhaps not as elegant as XCone, this hybrid approach would still result in a fixed number 
of approximately conical jets. We leave a study of these hybrid methods for future work. 

B Background Considerations for Boosted Higgs 

In section 3, we showed that XCone yields excellent signal efficiency for Higgs reconstruction, 
with an approximately flat response in the case of N = 2. While a dedicated background study 
is beyond the scope of this paper, in this appendix we show that XCone gives sensible results 
when applied to one of the main background sources: Z+jets. We have not included the effect 
of 6-tagging, though we suspect that XCone would work well with subjet 6-tagging methods 
[46, 47]. We use Pythia 8.176 [27, 28] at the yfs = 14 TeV LHC to simulate pp —>• Z + jets, 
forcing the Z to decay to neutrinos. As a proxy for the Higgs px, we use the recoil px of 
the Z boson. We follow the same analysis strategy as in section 3, showing the background 
mistag rate for the N = 1, N = 2, and N = 3 + ISR veto strategies. 

As shown in figure 22 for N = 1, the reconstructed single jet mass increases for both 
XCone and anti -kx as a function of recoil Z px- In figure 23, this leads to a mistag rate that 
increases roughly linearly with px , as expected since the invariant mass of an ordinary QCD 
jet rises as a function of jet px■ This is part of the reason why jet substructure techniques 
like mass drop [35] are needed to control the background jet mass distribution. Just as in 
figure 9, XCone and anti -kx exhibit very similar performance. 

12 We thank Gavin Salam for discussions on this point. 
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Figure 22: Z + jets reconstruction for N = 1, as a potential background to associated Higgs 
production. As the recoil Z px increases from (a) 200 GeV to (b) 500 GeV to (c) 800 GeV, 
the single jet invariant mass increases, populating the Higgs signal region. See figure 8 for 
the corresponding signal study. 


Z + jets Mistag for N = 1 



Figure 23: Z +jets mistag rate for N = 1 boosted Higgs reconstruction as a function of Z 
recoil pr, with the mass window rrij € [100,150] GeV. The mistag rate grows with pr, with 
similar behavior seen between XCone and anti -kx- See figure 9 for the corresponding signal 
study. 
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Z + jets N = 2 (p > 200 GeV) Z + jets N = 2 (p > 500 GeV) Z + jets N = 2 (p > 800 GeV) 





Figure 24: Same as figure 22, but for N = 2. As the recoil Z p t increases from (a) 
200 GeV to (b) 500 GeV to (c) 800 GeV, the dijet invariant mass increases, since the two 
identified jets are unlikely to be proximate in phase space. Note that the scale on the x-axis 
has been increased compared to figure 22 to include larger jet masses. See figure 10 for the 
corresponding signal study. 
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Figure 25: (a) Z + jets mistag rate for N = 2 resolved dijet Higgs reconstruction as a 
function of Z recoil pr, with the mass window rrij € [100,150] GeV. The background rate is 
nearly constant as a function of pt , and similar between anti-fcr and XCone, showing that 
XCone does not unnecessarily sculpt background distributions, (b) Same as figure 11a, but 
combining N = 2 with N = 3, using the minimum pairwise mass to veto ISR. Again, we see 
a roughly flat mistag rate. See figure 11 for the corresponding signal study. 
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Turning to N = 2 in figure 24, the dijet invariant mass spectrum increases with recoil Z 
Pt, both for XCone and anti-Zc^. The reason is that there is no genuine 2-prong structure in 
the background, so it is far more likely for these algorithms to identify two widely separated 
jets at large invariant mass rather than two proximate jets at small invariant mass. In 
figure 25a, this leads to an approximately flat mistag rate as function of px of around 10- 
15%. As anticipated in the text, the combined N = 2,3 strategy in figure 25b has a higher 
mistag rate, though it still remains roughly constant as a function of px- Though anti-Zc^ 
has somewhat better mistag rates than XCone at high px, this is also where anti-Zc^ has low 
signal efficiencies (recall figure 11). 

Note that the above analysis did not include further attempts to mitigate backgrounds 
through 6-tagging, A r -subjettiness, or other substructure discriminant variables. Since the 
mistag rates for XCone and anti -kx are somewhat similar, we suspect that background miti¬ 
gation techniques used in current Higgs analyses can be adapted to XCone, though we leave 
a detailed study to future work. 
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